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A FUNCTIONAL LIMIT THEOREM FOR 
THE PROFILE OF B-ARY TREES 

By Eva-Maria Schopp 

University of Freiburg 

In this paper we prove a functional limit theorem for the weighted 
profile of a fe-ary tree. For the proof we use classical martingales 
connected to branching Markov processes and a generalized version 
of the profile-polynomial martingale. By embedding, choosing weights 
and a branch factor in a right way, we finally rediscover the profiles 
of some well-known discrete time trees. 

1. Introduction. The profile is the set or sequence of numbers of nodes 
at each level of a tree. It is a fine tree shaped parameter related to many 
other important shape characteristics such as the total path length (the sum 
of the distances of all nodes to the root), depth (the distance of a random 
node to the root), height (the maximal distance of a node to the root), 
saturation level (the minimal distance of an external node to the root) and 
width (number of nodes at the most abundant level). 

In general, we distinguish between two main groups of trees. In the first 
group we collect all the trees by their height which increases through the 
square root of the number of nodes. The prime examples of these kind of 
trees are Galton- Watson trees conditioned on the total progeny or simply 
generated trees. It is shown in Aldous (1993) that the simply generated trees 
studied by Meir and Moon (1978) and conditioned Galton-Watson trees 
are the same. Moreover, the profile of conditioned Galton-Watson trees is 
further investigated in Drmota and Gittenberger (1997), Kersting (1998) 
and Pitman (1999) with further references. 

In the second group we collect all the trees with logarithmically growing 
height. In contrast to conditioned Galton-Watson trees, the profiles of these 
trees have received less attention in the past. However, there has been much 
done on this topic in the last few years. The methods used to derive limit 
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theorems for normalized profiles range from the method of moments, the con- 
traction method to analytical tools, including saddlepoint methods, Mellin 
transforms, Poissonization, de-Poissonization, singularity analysis, applica- 
tion of generating functions and uniform asymptotic analysis. We refer to Al- 
dous and Shields (1988), Drmota and Hwang (2005a, 2005b), Fuchs, Hwang 
and Neininger (2006), Hwang (2005), Devroye and Hwang (2006), Drmota, 
Janson and Neininger (2008), Park (2006) and Park et al. (2009). 

Furthermore, Chauvin, Drmota and Jabbour-Hattab (2001) and Chau- 
vin et al. (2005) used martingale methods to obtain limit theorems for the 
profile of the random binary search tree. It is a classical result of Jabbour- 
Hattab (2001) that the profile-polynom of the binary search tree, M„(z) := 
^T^^ Ui{n)z\ is a discrete time martingale where Ui{n) denotes the num- 
ber of nodes in generation / of a random binary search tree of size n [cf. (11)] 
and Cn{z) is a normalizing deterministic constant for z G C fixed [see (14) 
for a definition]. With this in mind, classical martingale convergence re- 
sults may now be applied to investigate the (asymptotic) behavior of this 
martingale and the asymptotics of the profile of the binary search tree [see 
Chauvin, Drmota and Jabbour-Hattab (2001)]. Instead of analyzing Mn{z) 
directly, Chauvin et al. (2005) showed that the discrete time martingale 
Mn{z) is deeply related to the well-studied classical Yule-time martingale 
M{t,z) of the corresponding continuous time tree, the Yule tree [cf. (9)]. Fi- 
nally, they strengthened classical convergence results in order to relate the 
uniform convergence of M(t, z) in compact sets to corresponding uniform 
asymptotic results of Mn{z). 

Apart from the profile of the binary search tree, the profile of the random 
recursive tree has been studied with martingale methods [cf. Drmota and 
Hwang (2005a)]. Drmota and Hwang (2005b) showed that the profile is not 
concentrated around the mean. Their proof is based on showing that the 
variance undergoes four phase transitions and exhibits a bimodal behavior 
in contrast to the unimodality of the expected value of the profiles. As a 
consequence, the profile is not concentrated around the mean. For example, 
around the most numerous level (where the width is attained) the variance 
is small [O(logn)] and the profile is concentrated with a Gaussian limit, but 
logn away from this level the variance increases, and there is no concentra- 
tion anymore [cf. Drmota and Hwang (2005a, 2005b), and further remarks 
in Example 7.4]. Recently, Sulzbach (2008) used the martingale method of 
Chauvin et al. (2005) to show a limit theorem for the profile of plane oriented 
recursive trees. 

Obviously not all trees have a martingale structure, and, furthermore, the 
method rapidly becomes costly when we leave the well-known binary search 
tree and recursive tree cases. The last statement is based on the fact that 
corresponding statements about other trees, and especially their continuous 
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time matches, are not available which differs from assertions about the well- 
studied Yule tree process. 

However, there is a large class of trees for which the martingale method 
gives uniform convergence results for the profile in any suitable compact set. 
In this paper we are interested in the kind of trees that can be studied with 
martingale methods. We want to find out how the method of Chauvin et al. 
(2005) can be generalized to derive asymptotic results for other trees besides 
the ones that have been studied so far. 

Broutin and Devroye (2006) constructed a class of continuous time edge- 
weighted trees and proved, using large deviation techniques, a general law 
of large numbers for the height of these trees. This class of edge-weighted 
trees leads, when the tree process is stopped correctly, to various well-known 
discrete time trees as, for example, to random binary search trees and to ran- 
dom recursive trees. The initial point of their model is the construction of an 
infinite, 6-ary tree where each node u in that tree is assigned, independently, 
a random vector, 

((zi"),<)),(z("\i?(")),...,(z("\£;("))), 

where, for instance, Ej^^ is the lifetime of n's jth child, and zj"^ is a specific 
weight assigned to the jth child of u [cf. (2)]. Let Exp(^) denote the exponen- 
tial distribution with the first moment equal to -,li> 0, in other words, the 

distribution with Lebesgue density f{x) = Aie~'^^']l[o^oo)(2;) for x G M. A con- 
tinuous time tree process is Markovian with a classical martingale structure 
if and only if the lifetimes are independent of each other and exponentially 
distributed (see, e.g., Harris [(1963), Chapter V.2], Watanabe (1967), Joffe, 
Le Cam and Neveu (1973), Athreya and Ney [(1972), Chapter III], King- 
man (1975), Biggins (1977), Wang (1980), Uchiyama (1982), Neveu (19887), 
Biggins (1991) and Biggins (1992)). Essentially, this follows from the mem- 
oryless property of the exponential distribution. 

Our purpose is now to show that these trees have the right martingale 
properties in order to generalize the method of Chauvin et al. (2005). The 
main theorem of this paper. Theorem 6.6, states that the normalized profile 
of a 6-ary weighted tree converges almost surely to a limit as the number of 
nodes in the tree tends to infinity. This limit can be identified as the almost 
sure limit of a discrete time martingale and, also, as the unique solution of 
expectation one of a fixed point equation. 

The plan for the rest of the paper is the following: First, in Section 2 we 
will introduce the tree model of Broutin and Devroye (2006) and define a 
closely related branching Markov process. Second, in Section 3 we will define 
the corresponding continuous time martingale associated with the branching 
Markov process of Section 2 and its discrete time analog in Section 4. Next, 
in Section 5 we will elaborate the relationship between those two martingales 
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and formulate our main result, Theorem 6.6 in Section 6. Finally, in Section 
7 we will show the applicability of our main Theorem 6.6 based on some 
examples as, for instance, the well-known random binary search tree, the 
random recursive tree, random lopsided trees and random plane oriented 
trees. 

2. The framework. In this section we describe the tree model of Broutin 
and Devroye (2006). Let Too be an infinite, complete 6-ary tree with b>2. 
We assign to each node a label, 

oo 

(1) neU:={0}u\J{l,...,br, 

n=l 

in the natural way; the root node, which will be denoted by 0, has b children 
which are called l,...,b. In the same manner, every node u has children 
denoted by ul, . . . , ub. Generally, ii u = ui- ■ - ui is a node, and v := vi - ■ -Vk 
is a sequence with Vj £ {1, . . . ,b},j = I, . . . ,k, we set 

UV = Ui - ■ ■ UlVl • • - Vk 

and call u an ancestor of uv. 

For each node u we create independently a random 6-vector, 

where for j = 1, . . . ,6, [Z^'^\ E^^^) is the vector assigned to the edge from 
node u to its jth child. Here, represents the weight of the edge from 

(u) 

node u to its jth child, and Ej is the lifetime of node li's jth child. 

Each couple {Zj^\Ej^^) is distributed as {Z,E) for independent Z and 

(u) (v) 

E. Note that the lifetime E- is independent of E)- for different u and 
V or different j and r. All S's are independent of any Z's, but we allow 
dependence of zj"^ , . . . , Z^'^ for any node Vj in the tree Too • 

We also assume that E is exponentially distributed with mean one to 
ensure Markov properties and that Z is a lattice distribution with values in 
Z'^ for some d S N. 

Remark 2.1. In Broutin and Devroye (2006) it is assumed that E and Z 
are nonnegative independent random variables where E is not mono-atomic, 
has no atom at zero and the following property holds: 

mi{x:P{E>x)>0} = 0. 

As mentioned in their concluding remark, their model and their proof of the 
law of large numbers for the height can be extended to more general cases 



FUNCTIONAL LIMIT THEOREM FOR THE PROFILE OF B-ARY TREES 5 



allowing dependence of E and Z as well as multi-dimensional versions. In 
Broutin, Devroye and McLeish (2008) the assumption that the components 
of the random vectors attained to each node are independent is skipped 
in order to obtain further height results of, for example, pebbled trees and 
others. 

Let 7r{u) be the set of edges from the root to node u. Further, let {Ze,Ee) 
be the couple of random variables assigned to edge e, and let 

(3) ft:=Lef^:Gu:= X] < t 

eg7r(«) 

be the subtree of Too consisting of the nodes that deceased before time t. 
We are interested in the external profile of Tt, 



(4) m--- 

eg7r(u) 

for teR+ and / G Z'^, where 

dTt := {u G Too : if u = • • • u„ for some n, 

then m • • • Un-i £ft,u^ ft} 

is the set of all external nodes or leaves in Tf. 

For this purpose we will study a closely related branching Markov pro- 
cess (a jump or step Markov process) {Tt)t>o which will be described next. 
We start the tree Tq with one particle, the root 0, which is alive at time 
t = and set accordingly Tq := {0}. This initial ancestor dies at a random 
time Ti where ti is exponentially distributed with mean one and bears b 
children. These children behave independently from and similarly to their 
ancestor. After the first birth the individuals are alive with in- 

dependent lifetimes equal to {e[^\ . . . , E^^^ ) , respectively. At time T2 := 
m.m{E^\ . . . , E^^^^} + Ti the corresponding individual deceases and gives 
birth to b new individuals, namely its children. Because of the memory- 
less property of the exponential distribution, all individuals alive just after 
T2 (namely the 6—1 remaining children of the root and the b children of the 
root's child that deceased at time T2) behave similarly to and independently 
from each other, having an exponentially distributed lifetime. We define Tt 
as the tree corresponding to the process described above when it is stopped 
at time t > 0. It is clear that we have Tt = Tt+n - In general, let (Tj)o<j<oo 
be a sequence of Markov times with 

(5) To = 0, Tj:=ra\n{t:Nt:={b-l)j + l], Nt = \dTtl 

where Nt is the number of external nodes or leaves in the tree Tj. Con- 
sequently, Tj is the time of the jth death. By convention, we consider all 
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internal nodes at time t deceased and the remaining external nodes alive. 
There are k internal nodes if and only if there are exactly (b — l)k + l external 
ones. 

Since ri = Exp(l), we obtain 

Tk - = min{£;i, . . . , = Exp((6 - 1)(A; - 1) + 1), 

where (Ej) are identically distributed, independent random variables, dis- 
tributed as E and representing the remaining lifetimes of the {b—l)(k — l) + l 
nodes alive at time Tk-i after the {k — l)th death. For the distribution of Tk 
we obtain then 

E, 



(6) n^Y. 



-l)(j-l) + l- 

General notation. We denote for functions /, (7 : N i— > M where g{n) ^ 
V?i G N, 

/~a.s.5 ^ lim44 = l ^ /(?i) = <7(n)(l+o(l)) 
rn>oo g[n) 

with 0(1)— 7'0,n^oo. In the same manner we define 

f(n) 

/<a.s.5 ^ lim:^<l 4^ /(?z)<5(n)(l+o(l)) 
n-5>oo g[n) 

with 0(1) — )• 0, n — 00. 
We let No :=NU{0}. 

We summarize the results in the following lemma which is the result of 
Chauvin et al. [(2005), Lemma 2.1] in the Yule tree case. 



Lemma 2.2. (1) We have — Tk-i)k>i independent and 
rfc-rfc_i = Exp((6-l)(A:-l) + l). 

(2) Further, {Tk)k>i o,nd {TT-f.)k>i are independent. 

(3) We have {h — l)r„ ~a.s. logn almost surely. 



Proof. The first two statements can be proven with the same argu- 
ments given in Chauvin et al. (2005). The last assertion is Proposition 1 in 
Broutin and Devroye (2006). □ 

Exactly in the same manner as for the tree Too, we assign to each node 
in the tree Tj a label u £U in the natural way [cf. (1)]. Additionally, we 
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weighted the edges in the tree {Tt)t>o as the values induced by Too- We 
define for n = ui • • • u„ G ZY, 

k=l 

as its weight or weighted position and accordingly for I G Z,'^ and t > 0, 

pt{l) :=\{uedTt:Du = l}\. 
Note that we obtain for t > 0, dTt+n is equal to dTt. 



3. Continuous time martingales. Our ambition is the study of the profile 
of a class of discrete time trees {Tk)k that can be constructed from the class 
of continuous time trees considered in Section 2 [cf . Section 7] . We will follow 
the ideas used by Chauvin et al. (2005) in the case of the random binary 
search tree and the Yule tree process and construct trees {Tt)t>o described 

in the last section with T^^ = 7fc. Alternatively, we deal with trees 7k '■= T^f. 
whose profiles are at least comparable to the profile of Tfc. Recall that if 
we stop the tree process Tt at time t = t^, we have k internal nodes and 
(b — l)k + 1 external nodes or leaves in the tree. 

These general trees are not as well studied as the binary search tree and 
its continuous time analog, the Yule tree process. Chauvin et al. (2005) 
used classical results from the Yule tree process and a corresponding frag- 
mentation process [cf. Chauvin et al. (2005), Section 2.2] to formulate their 
main profile convergence result (see their Theorem 4.1). In general, it seems 
difficult to use a connection with a fragmentation processes. 

Note that in the following we consider the natural filtrations {J-t)t>o and 
(-^(fc))fceNo where 

Tt ■■= cr{Ts,s < t) and T(^k) ■= (^{Tt, , 1 < j < ^)- 

For V = {vi, . . . , Vd) and w = {wi , . . . , Wd) G C'' we define 

d 

V + w = {vi + wi, . . . ,Vd + Wd) and v ■ w := VjWj . 

i=i 

Set 

(7) zW(dx):= ~6{D^}{dx), 

u&dTt 

where 5 denotes the Dirac measure. 
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Let m(A)* := E f e~^'^ Z'^^\dx) for t > 0. Tlien, for |m(A)| < oo we define 
the classical martingale W^^\X) for A G C^, t>0, as 

(8) T^W(A):=^|e"^-zW(dx). 

Using Biggins [(1992), page 148], it follows for t > that 

m{XY = exp[t{bEe-^-^ - 1)]. 
For the special case d = 1 we define, setting z : = e'^ G C, 

(9) M{t,z):=W^'\-X) = W^'\-logz)= ^D^^-tibEz^-i) _ 

uGdTt 

This continuous time martingale was studied for Z = 1 and b = 2 (the Yule 
tree case) in Chauvin et al. (2005). 

4. Discrete time martingales. Let Tn ■= T^^ be the discrete time edge- 
weighted tree with n internal nodes. We define for n G N and A G C'' \ {A : 
C„(A) = 0), 

where Cn(A) is a multiplicative factor which we will specify below, and Ui{n) 
is the number of external nodes in Tn at (weighted) level / G Z'^; 

(11) Ui{n):=pr^{l) = \{u^dTr^:Du = l}\, / G n G N. 

We choose C„(A) in order to make Wn{X) a martingale with respect to the 
filtration = 0"{7j, 1 < j < n}. In the case d = l,b = 2 and Z = 1, the 
random binary tree case, the martingale 

(12) Mn{z):=Wn{-log{z)), 

where z G C was found and Cn{— log z) calculated by Jabbour-Hattab (2001). 

For the general case, let Dn be the weighted depth of the nth inserted (in- 
ternal) node in (7^)m6No respectively {Tt)t£M.+ - Because of the exponential 
distribution of the lifetimes, each alive individual (external node) is equally 
likely to be the next one to die and to become an internal node. Therefore 
we obtain for I G Z'^ 

Assume now that node u G T^-^ is the next node to expire and to bear b new 

individuals in the tree Tr^^-^^. Denote by Z^p~^^\ I < j <b, the d-dimensional 
weight assigned to the edge from node u to its jth child. Then we have 



(13) Ui{n + 1) - U,{n) = -l|z5„^,=o + 1^^^^^^^ 
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With Wn{X) := YlieZ'i Ui{n) exp(— A • /) and Wq{\) := 1, we obtain from (13) 

I 

Ui{n) 



I 

b 



l)n + l 



Wn{X)- 



(6- l)n + 1 
Iterating this we have that 



n •=d((^ - i)j + bEe^^-^)/{{b - i)j + 1) ; „e 



is a (^(n))n6N adapted martingale. Consequently, we set it according to our 
notation for n > 1 

and Co (A) := 1. We define for further calculations and, for reference, the 
following set: 

Nc := {Xe&: Cn{X) = for some n} 
(15) . ^ 

A : -j—^Eexp{-X • Z) G I^o 

5. Relationships between discrete and continuous time martingales. In 

this section we will study the relationship of the two martingales {Wn{X))n£No 
and (VFW(A)) t>o defined in Sections 3 and 4. With C„(A),n G No, defined 
in (14), we set for A G \ iVc [cf. (15)], 

nniX):=Cn{X)e^-^'-'''''-"'\ n>0. 
We claim that (^n(A)) 

ngMo is a martingale with respect to the filtration 
(Jv„)n6No with expectation 1. 

Lemma 5.1. Let XeC^XNc- Then: 
(1) W^--\X)=nn{X)Wn{X), nG No. 
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(2) ('Hn(A))ngNo 'is a martingale with respect to the filtration (-^T„)neNo 
with expectation 1. 

(3) {HnW)nmo ffl"-^ {Wn{X))n&No independent. 

Proof. The first statement is a direct consequence of tlie definition. 
Tlie otfier two statements follow from an application of Lemma 2.2. □ 

5.1. Asymptotic behavior and further relationships. In this subsection we 
are interested in the asymptotic behavior of Cn{0) for 6 G M*^. 

Lemma 5.2. For G W^, b>l, and n—?-oowe have 
n^/(b-i)(bEe~^'''-i) r(l/(b-l)) 

Proof. Let a ^ and /? G M. From Stirling's formula for the Gamma 
function [see, e.g., Flajolet and Odlyzko (1990)], we obtain 



aj + (3 _ f n-{-f3/a) - 1 



n-l 

n ai + 1 V n 



n- i-l/a) - 1 



n 



-1 



With a = (6 — 1) and /? = bEe ^'^ we obtain the statement. □ 

Since ^n(A) is a positive martingale for Ee^'^'^ > 0, we obtain immedi- 
ately from a well-known, classical martingale result, for every A G ffi'^, ^n(A) 
converges almost surely to a limit 'H(A) as n — )• oo. More details are given 
in the next lemma. 

Lemma 5.3. Let A G M'^ \ Nc with Nc defined in (15). Then we have, 
almost surely, 

/ Y r(l/(6-l)) 

^"(^)^V^J n{bEe^^-^)/{b-l)y 
whereY = T{-^,-^). 

Proof. First we show that e^^^^^^^ Nt, defined in (5), converges almost 
surely and that the limit law is gamma distributed. From Athreya and Ney 
[(1972), Remark 1, page 109] we detect that 



Es^' = se-\l - {1 - e~ 
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As a consequence of setting a :=b — 1> and u = —is for s G M, we obtain 
by standard calculations 

(16) i?e^^^*^"" ^ ( CO. 
^ ' \ms + lj 

The right-hand side of (16) is the characteristic function of a r(l/a, 1/a) 
distributed random variable with density 

With Doob's limit law it is immediately verified that the continuous param- 
eter nonnegative martingale {Nte~^^~^^^)t>o converges with probability one 
to a finite limit Y which is gamma distributed with parameters (^, ^): 

(17) iVte~(''-^)* ^ y, CO, a.s. 
Consequently, we have for n — t- oo, 

(18) (6- l)ne^(''^^)^" a.s., 

since t„ converges toward infinity almost surely. Therefore, we obtain almost 
surely with Lemma 5.2 that for n — )• oo 

and AG]R^\iVc, 
Hn[A) a.s.in e ) T{{bEe-^-^)/{b - 1)) 



Y y^^e-^-D/cfe-i) r(i/(6-i)) 



b-lj r{ibEe-^-^)/{b-l))' 



□ 



Note that for defined in (7), we have 

Nt 

zW(cix) = J]l^^(„^(dx), 

r=l 

where Zr*^ is the weighted position of the rth individual alive at time t. If 
the rth individual is equal to node u, then zi^^ is equal to G Z'^. Note 
that the positions of the individuals alive just before — the time of the 
nth death — are {4^""'^ : 1 < r < (6 - l)(n - 1) + 1} as the particles do not 
move during their lifetimes. 

Before we state the next theorem, we formulate the following preliminary 
proposition. The following set will play an essential role in the sequel (cf. 
Theorem 5.5). For 1< 7 < 2, A = 6^ + i??, define C by 

n\ :=int{A:6^e^^^-^ <oo}, 
a^=int|A:^^<l 



7 •" 



m(A)|' 
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We set 
(19) 



A:= IJ nl^na 

1<7<2 



Proposition 5.4. For 9eA,5>0,n>l,aeM.we have: 
(1) SH^(^")(6') = (^?T^((9)-^")^^■ 



Ar„) 



(2) Ee - — Em{e)-^^{{b-~l)n+l)' 

(3) r((iog(V^)^"+i)/(^-i)) ^- iog(i/^) V(fe-i) . 

Proof. For the proof of the first statement we let n > 1, setting 
l)n + l, 



In particular, we designate = log?Ti(0) + 1. Using induction over n, we find 



a,n-i 



(t„-i) 



n 



r=2 j=l 

(6-l)(i-l) + l+logm(g) 
(6-l)(i-l) + l 



+z,) 



Using (6) we obtain 



E, 



(20) 



n 



^^-(6-i)(j-i)+ 

(6-l)(j-l) + l 



L(6-l)(j-l) + l-log,5«- 



From this we can finally conclude that ^„ = ^n(^) = {Em{6) '^") 
For the second statement we obtain immediately from (1) that 

i^e-^-r-' =cn= —in = —{Em{er^-r\ 

ttfl (Ijl 

Finally for the last assertion, using the same asymptotic result as in 
Lemma 5.2, we obtain 



" -11 

TT gj + 1 

aj + 1 + X 



(x + 1 



r((x + l)/fl) 
r(l/a) 



(n+1)-^/^ 
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and further, using (20), 

r((iog(i/jr + i)/(6-i)) 
> r(i/(6-i)) ■ □ 

In the fohowing theorem we describe the convergence behavior of the 
discrete time martingale of Section 4 [see (10)] and of the continuous time 
martingale of Section 3 [cf. (8)]. The first part is based on Biggins (1992), 
Theorem 6. The proof of the second part is based on the application of the 
first part of Theorem 5.5 and the relationship between the two martingales 
(cf. Lemma 5.1). 

For a set AcC^ let int(A) denote the set of interior points of A. 
Theorem 5.5. With A defined in (19) we have: 

(1) ast^oo, {VFW(A)} converges, a.s. and in , uniformly on every 
compact subset C of A; 

(2) as n — 7- oo, {W„(A)} converges, a.s. and in , uniformly on every 
compact subset C of A. 

The limits are denoted as W^°°\X), respectively, Woo (A). 

Proof. The first part was proven by Biggins [(1992), Theorem 6], so 
we only need to prove the second part. 

Let C C A be a compact subset of A. Therefore, 

lim sup S sup I W„( A) -Wat (A) I =0 

^ n>N AeC 

which implies the uniform convergence. Additionally, due to the fact that 
(sup;^gf; |W^n(A) — Wj\[{X)\)n>N is a submartiugale, this implies also the a.s. 
uniform convergence. From Lemma 5.1 we have 

Wn{X) - Wn{\) = E{W^--\\) - W^(^^)(A)|J-(„)). 

Now taking the supremum and expectations we further deduce 

EsvLp\Wn{X)-WN{X)\ <e(sup\W^^"\X) -W'-^''\X)\). 

Taking the supremum over n> N we get 

sup -Esup |Wn(A) -Wn{X)\<E sup f sup|T^(""")(A) - T^(^^)(A)| 

<EAn, 

where we set 

A„:= sup (sup|P^(^)(A) -H^(""")(A)|). 
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Since W^^\X) converges a.s. uniformly, we have a.s. that Hm„_j.oo A„ = 0. 
With the triangle inequality we obtain that A„ < 2Aq. If we show that Aq 
is integrable, we deduce lim„ EAn = and the statement of the theorem 
using dominated convergence. 

Let Mx{s) ■.= {s>0:W^'\X) ^W^'~\X)} for A G A and s > 0. As it is 
shown in Bertoin and Rouault [(2003), proof of Proposition 3] it is sufficient 
for proving the integrability of Aq to show that for all j; G A a disk, 

Z),(p):={AgC'^:||A-x|| <p}, 



exists with 



sup e{ y \W^"\>^)-W^'-\X)\'') < oo 



for some q G (1,2]. Note that the set Mx{x) is a.s. countable. We observe 
that W^^^lx) / V^(*")(A) if and only if t = r„ for some n G N, and for that 
reason we can conclude that 



y \W^'\X)-W^'"\X)\''^ 



£;^|Ty(--)(A)-VF(-"")(A)|^ 



n=l 



n=l 



?n(A) ^" 



Now set ttn-i '■= (6 — l)(n — 1) + 1 for n G N. Then we derive the following 
formula for Z^'^"^: 



(21) 



■r=2 ■ - j=l 



where (Zi, . . . , Z^) are independent of {z. 
as {Zt\...,zi'^) [cf. (2)]. 



1 < r < fln-i} and distributed 



Then, with Lemma 2.2, Jensen's inequality, independence and Proposition 
5.4, we have for q G (1, 2], := o"(rj, 1 < j < n) and n G Nq, that 



E\W^^^\X)-W^^"-\X)\' 



E \ |m(A)r«^"^ 



(^n-l) 



(22) 



/ b 

E 

\j=i 
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ttn-i Em{9q) 
Recall the definition of A as 

A= U (l7^no2). 

1<7<2 

For X S 0^ n 0^ for some 7 G (1, 2], we can choose p sufficiently small so that 
Dp{x) C n 0^. Then there exist some 5 < 1 so that for all A S Dp{x) we 

have ™|^^T\-)|'^ ^ ^ for some 7 G (1,2]. Using this we obtain with Proposition 
5.4, 

E\m{X)\-''^'^{Em{ejy^"-')-'^ <a.s. C{j, X,b)n^°^^'' /^''-'^l 
With (22) by setting g = 7 G (1, 2] we obtain 
E\W'^^"\X)-W^^"-\X)\^ 



<a.s. 2^-1^ (^E (^X: e-^'-^y + 1^ C(7, A, 6)ni°s^V(^-i) 
s.(7(7,A,%-(i+^) 



for a suitable constant (7(7, A, 6) independent of n, and e := ^^^^g^f^^- Since 
^^n""*^"^ < 00 and sup;^g£)_(-^.-) (7(7, A, 6) < 00 for p sufficiently small, we 
obtain the statement. □ 

Remark 5.6. The proof of the second part of Theorem 5.5 is roughly 
the same as the corresponding proof of Chauvin et al. [(2005), Theorem 3.1] 
up to the point where the integrability of Aq is proven. Their arguments 
can also be used in our more general case except of the one concerning 
the integrability of Aq = sup7i>oSup;^gc' |1^*^*^(A) — 1|. Since the Yule tree 
process is a special kind of a fragmentation process, the integrability of Aq 
in Chauvin et al. (2005) can be obtained from a result of Bertoin and Rouault 
(2005). To be more precise for fragmentation processes, the integrability can 
be verified by an application of the compensation formula for Poisson point 
processes applied to the Poissonian construction of the fragmentation (see 
Bertoin and Rouault [(2003), Proposition 3] and Bertoin [(2003), Theorem 
2] for more details) . Is seems difficult to use a similar argument in the general 
case. 



16 



E.-M. SCHOPP 



Remark 5.7. Recall that we have already defined 



A= J {n]na 

with 



2^ 

1<7<2 



0^ = int{A : bEe-"'^-^ < 00} and 
= int<^ A:-^ — )4tt- < 1 



|m(A)|' 

in (19). Using Biggins [(1992), Theorem 6], we have that {W^W(A)} converges 
uniformly on any compact subset of A, almost surely and in mean, as i — >■ 00 
[cf. Theorem 5.5]. 
Now define 

A:= AnM'' 

as the restriction of A on Mf^. Then we can rewrite A and show that 

(23) A= y n\nn^ 

1<7<2 

with 

■.= leeR'^:eeno:- logm(g) < " ""'/^^ 1 and 

I m{e) J 

:= int{A G & : m(Re(A)) < 00} 
(cf. Biggins [(1992), page 141]). Since 

m{e) = exp(6^e"^-^ - 1) and 
m'{e) =m{e){-bEZe^^-^), 
we have for any ^ € Oq ^ M^, 

9£n^ ^ <b9- EZe^^-^. 



5.1.1. Subtree sizes. The rest of Section 5 is devoted to a further char- 
acterization of W^°°\-), resp. Woo(-)5 defined in Theorem 5.5 as solutions of 
fixed point equations. For deriving these fixed point equations we will split 
the original tree in the b subtrees which are growing from the children of 
the root. For that purpose we will investigate the sizes of the subtrees which 
are growing from a node in the tree. For every u^U let 

r(") := inf{t : n G 
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be the time of the first appearance (birth) of node u in the tree. For t > 
set 

r/") :={veU:uveT,^,,uj}, 
the tree process growing from u. Further set 

Nt^ = \dTt^\ and := N^^l^, , 

the number of leaves at time t > in the subtree growing from node u. 
Then, using the same arguments as in (17), we obtain 

(24) lim e-(''-i)*Afi^"^ = Yu and 



t—>oo 



-t{b-l)(u)_^ r(-)(b-l) 



(25) hm e-'^'-'^n^' = ^^e""^ 

t—^oo 

where Yu is distributed as Y; that is, it is r(^^,j^) distributed. If u,v 
are not in the same fine of descent, we can conclude that by the branching 
property Y^ and Yy are independent. Since for t > r*-"-* we have 

nj") = nj"^) + nj"') + • • • + n^"'^ , r (^i) = r ("2) = . . . = ^ ("ft) , 

we obtain using (25) 

and for that reason we further get 

-I 7/. 



(26) ^ ^ T^T^ - 1<J<^- 



(n) ■ y^fe y . 



Finally, with ti = t^^\ we have 

(27) y:=y, = e-^i(''-i)(yi + ... + n), 

(28) = [/(j)ye^i(*-^\ 1 < i < 6 and 

(29) l = ^[/(i). 

The distribution of the subtree sizes and their limit distributions can now 
be further calculated using a generalized Polya-Eggenberger urn model. 
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5.1.2. Limit martingale equation and splitting formulas. In the following 
statement we derive representations of W^°°\-) and VFoo(') as solutions of 
fixed point equations. Furthermore, the first part of Theorem 5.8 emphasizes 
the close relationship of W^°°\-) and Woo(-)- 

Theorem 5.8. Let us assume A G A [of. (23)]. Then the following for- 
mulas hold: 



(1) limit martingale connection. 



{bEe-^-^-l)/{b~l) 



(30) 

r((5^1'-i'v(i-i)) ^-^^^ 

withY^V{j^,,^)- 
(2) splitting formula, 

(a) for the continuous time process 

b 

(31) W^^\\) = ^e-^-^^e^^i(^^^"''^^^%j~)(A), 

i=i 

where w\^\\) , . . . ,W^^\\) are independent, distributed as W'^°°\\) and 
independent of ti and {Zi,...,Z{)) where {Zi,...,Zh) are the weights as- 
signed to the edge from node to its children 1, . . . ,b €U , 

(b) for the discrete time process, 

(32) Tyoo(A) = ^e-^-^^([/(^-))(^^^-^"-^)/(^"^)t^o.,(,)(A), 

i=i 

where VFoo,(i)('^)' • • • ' ^oo,(fe)('^) '^'"c independent, distributed as Woo(A) and 
independent of (U^^^) where (U^^^) are defined in (26). 

Proof. (1) This is a consequence of Lemmas 5.1 and 5.3. 
(2) (a) For t> ti we have 

t^W(A) = ^e-^-^^e--^(''^^"'-^-i)H^j*-^^)(A), 
i=i 

where for j = 1 , . . . , 6 we set 

Wf\\):= ^-X-D^^-tibEe-^-^-l)^ 
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and we let {Zi, . . . , Zb) be the weights assigned to the edges from to its 
children 1, . . . , 6. Now let t ^ oo. Then the assertion follows. 

(b) Follows now from (1), (2)(a) and (28). □ 

6. Profiles and large deviation results. To prove our main theorem, The- 
orem 6.6, we will need some preliminary lemmas that will be stated at the 
beginning of this section. The first one. Lemma 6.2, is based on Biggins 
[(1992), Lemma 5] in the discrete time case with nonlattice weights Z. The 
proof for the continuous time nonlattice version can be managed with some 
additional arguments [see Biggins (1992), page 150]. For the lattice case note 
that the critical points in the proof of Lemma 6.2 are the values r/ G M'^ with 

= exp(6Se^^-^(cos(?7 • Z) - 1)) = 1, Oe J^q. 

This can only occur if Z is a lattice distribution or if 77 = 0. Let 

(33) N ■.= {leZ'^:P{{Z = l})>0} 

denote the support of Z. For r/ = (r/i, . . . , rj^) G M"' we denote by 

|r/| :=max{\r]j\,j = l,...,d} 
the maximum- norm. Denote for vi, . . . ,Vr G^"^ hy 

£{vi,. ..,Vr}:=f^v:3X = {Xi,..., A,.) G w = ^ XjVj | C M'^ 

the subspace generated from the vectors vi,. . . ,Vr and, analogously, let for 
some subset S C M"^, £{S) be the subspace in generated from the vectors 
in the set S. The next proposition is obvious and will be stated for further 
reference in the paper. 

Proposition 6.1. Let Z be a lattice distribution with support N ; define 
feiv) ■■= bEe-^-^ {cos{i] • Z) - 1), ^ G 17o, r/ G R"^ 
and let a G M}^. 

(1) // the dimension of £{N) is equal to d, then there are only finitely 
many roots rjo = 0,rii, ... ,r]m., of fe{-) in [— 0,0]*^ independently for all9^Q.Q 
where m = m{a) G No- 

(2) // the dimension of £{N) is r < d, then there are ??o,i, . . . , ??o,d-r G 
(linearly) independent vectors, % = 0, r/i, . . . , r/m G [—a,a]'^ where m := 

m{a) G No so that 

m 

L:={r,e[-a,a]'':fe{v)=0}c\JC„ 

j=0 



m(A) 
m{e) 
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where 



< J < m. 



Lemma 6.2. For every lattice- distribution Z we have almost surely 



(34) lim sup 



t—>oo 



Vf\W'-'\e + iri)-W^°°^ 



\<TT 



m{6 + irj) 



m{e) 



dr] = 



for every compact subset K C A. 



Proof. Let C and K represent here and in the rest of the proof some 
arbitrary generic constants in whose values may differ. Let fei-) be 
defined as in Proposition 6.1. 

Using the notation of Proposition 6.1, we will first assume that the di- 
mension of £{N) is equal to d. As in Biggins (1992) we divide the integral 
into two parts for \r]\ < e and £ <\rj\ < vr. 

We will consider the case |r/| < e first. With the standard Taylor series 
estimation we have for small e and \rj\ < e with feiO) = /g(0) = 0, 

m(A) 



(35) 



sup 



<exp{—C\r]\ ) 



m{e) 

for some constant C > 0. Now let 

Ke := {A : Re(A) eET, I Im(A) I <e}. 
Then, with Theorem 5.5, we have 

m(A) * 



sup 



m{e) 



< sup \W^^\X) - W^°°^ 



\n\<eVi 



drj 

exp(— C|r/p) drj 



i — >• oo. 

The last expression can be made arbitrarily small by choosing e sufficiently 
small. 

Next, we consider 



sup 

e6_fs:-^e<l»)l<i- 



+ ir])-W^°°\e)\ 



m{6 + irf) 



m{e) 



drj. 



Let {r/i, . . . , r/„^(7r)} be the roots of feiri) or equivalently the values for which 
l^^^^py^l = 1, € r^o a-iid < I??! < vr. This is the case if and only if 77 • / G 
27rZ for all I & N and < Ir^j < vr. Consequently, we have cos{rjj ■ Z) = 1, 
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sm{r]j ■ Z) = 0, e ^^^'^ = 1 and m{9) = m{9 + ir]j) a.s. for j = 1, . . . ,m 
m(7r). With the triangle inequahty we obtain 
_ 



sup 



< 



Vi \W^'\9 + i7])-W^'=^\9)\ 



m{9 + irj) 



m{9) 



df] 



sup 

eeK J\v-Vj\<£ 



Vt'^\W^'\9 + i7])-W'''\9)\ 



m{9 + ir]) 



+ sup 

eeK J\v-Vj\<£ 



Vi'^\w^'\e)-w'^°°\e)\ 



m{9) 
m[6 + irj) 



m{9) 



dr] 



dr] 



For the second integral, Ig''^ let Ke^^ := {A = 9 + ir] e -.9 e K ,\r]-r]j\ < e}. 
Again with a Taylor series estimation we have for e small and |?? — ?7j| < e, 

m(A) 



sup 



m{9) 

Then, for every e > we have 

sup / ^^t^\W^'\9)-W'^'^\9)\ 
e&KJ\v~Vi\<s 

< sup \W^^\9)-W^'^\9)\ 



< exp(— C|r/ — r]j\'^ 
m{9 + irj) * 



m{9) 



dr] 



exp(— Ci|ry — r^jl )dri 



\V-V]\<£ 

■ sup \W^'\9)-W^°°\9)\ 



|r?|<ev^ 



exp(-C|r/p)dr/^0, 



t — )• oo. 



-^(7r/C)'*/2<oo,t-5.oo 

by Theorem 5.5. 

Next, we claim that W^*\9 + irij) = W^''\9) almost surely for j = 1, . . . ,m. 
For the proof note that W^^\9 + irjj) = W^^\9) because of rjj ■ Zg G 27rZ a.s. 
for e G Tr{u),u G dTf. In particular we have also W^°°\9 + irjj) = W^°°\9) 
for j = 1, . . . ,m. Then, calculating the first integral, using (35), 

m{9 + irj) * 



sup 



Vi \W^'\9 + iri)-W^'\9)\ 



drj 



< 



sup 



\W^'\X) - W^'\9)\ 



X&K, 



(i) 



|r?|<£v^ 



m{9) 
exp(— C|?/p) dr] 
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with Theorem 5.5 letting t — )■ oo. This can be made arbitrarily small by 
letting e \ and, therefore, rj^ r]j. 

Next we show that for B := {rj : e < \r]\ < vr, \rj — rjj\ > e,j = 1, . . . ,m}, we 
obtain 



B 



mix) 



m{e) 



t 



■ oo, 



uniformly in a neighbourhood of any 6o G A and hence uniformly on K. The 
convergence of 

t 



B 



m(A) 



m{9) 



toward zero will immediately follow from these considerations, completing 
the proof. 

Now we have B = IJje/i^ • — 1^1 — ^ii — \v\ — ^} for some finite 
index set / and suitable aj < bj. Additionally, > 1 so that for all j € I 
we have fg{7]) / if 6^ G and j^aj < < fibj. For Oq £ A let Bp := {9 G 

R<^:\9- 9o\ < p} and let g'-J^ ■= {X:9 e Bcp.c-^aj < \rj\ < cbj}, j G /. It 



follows that g'-J^p C G)^Jp for < ci < C2, j G /. As 6*0 G A, there is 7 G (1, 2] 
so that 9o GQ'i. Then as a conclusion from the definition of Q.'i we have 



.(i) 



m(0o) 



< 1. 



Therefore, we can choose p sufficiently small, so that for some 5 < 1 we have 
-B^p C , and also 



snp{m{j9)^/^:9eBpp} 
mf{m{9):9eBpp} 



<6. 



Further for 9 gQq we have | \ < 1 for all j^aj < \r]\ < pbj and j G /. We 



I in{e) 

obtain that for r sufficiently small 

sup{|m(A)|:AGGg} 
inf{m(6l) : 9 G B^r} 

Let SW(A) :=/e-^-^ZW(dx) -m(A)*; then we obtain 
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Let In := {t:n < t < n + 1}. Then for j G / we find 



sup 



(36) 



aj<\v\<bj 



m(A) 



m{e) 



drj 



< Ky/n + 1 



.rf/sup{|i?W(A)|:AGG^-'\tG/„} 



\ inf{?n(6l)"+i -.OeBp} 

Now since B^^\\) is an analytic function, we can use Cauchy's integral 
formula, the triangle inequality [see Biggins (1992), Lemma 3], and a com- 
pactness argument to show that 



(37) 



sup |i?W(A)|<i^ [ \B^'\zis))\ds, jGl, 



where {C~!^^} parameterize the distinguished boundaries of a finite number 

of disks covering G^p and lying within for j G /. As in Biggins (1992) 
we take expected values of (37) and use Jensen's inequality for some a> 1 
to obtain that for j £ I, 

l/a 



(38) 



E sup |bW(A)| sup (^sup|sW(A)|" 



tein 



for some constant K. Note that m(A) is never zero and that |-B(*)(A)/?tt-(A)*| 
is a regular submartingale. Hence with a standard martingale inequality 
[Williams (1979), Lemma 43.3], we obtain 



E sup 



BW(A) 



m{\y 



< 



a 



a ■ 



1 



E 



5(n+l)(A) 



n+l 



and so the right-hand side of (38) is less than 



(39) 



K sup (E|5("+^)(A)r)^/", 



Now the proof continues exactly as in Biggins [(1992), Lemma 5], bounding 
(39) [cf. Biggins (1992), (4.6)] with Biggins [(1992), Lemma 6] and finally 
shows that the expected value of (36) converges toward zero, and thus (36) 
converges toward zero almost surely, if n — ?• oo. 

For the second case when the dimension of £{N) is r < d we choose a = vr 
in Proposition 6.1 and define 



where 



Dj ■.= Dj{e,7r) := {r? : |r?| < vr, d(r/, Cj) < e}, 
d(?/, Cj) := min{|r7 — w|, w G Cj} 
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for < j < m = m(7r) (Cj was defined in the second part of Proposition 6.1). 
Note that we can choose > so small that we have m(7r) = m(7r + 5). Set 



9e 



m{6 + irf) 



m{e) 



We have | "^f^g^^ | < 1 for ^ G S = 5(e, vr) := [-vr, t^Y \ UJLo , and 

m 

B{e, vr) C [-^ - <5, vr + 6f\^ [j Dj{e/2, tt + 6) =: B{e/2, tt + 6). 

j=0 

We can now use similar arguments as in the case r = d to obtain that uni- 
formly in a neighborhood of any 9q € A, and hence uniformly on K we have 



Jb 



Let Dj := {A G R'^-'-:r]j + Efc=i ^fc%,fc e [-vr, vr]'^}, < j < m, with 
(%,fc)i<fc<d-r as in Proposition 6.1. Then we conclude that 

m „ m „ „ 

ge{vf^dv<Y. / ^ ge{r,fUr,dX. 

With the same calculations as in the case r = d, we get for every p > 0, 

?(r7)Wdr7<p 



\v-{v,+T.t2l^kV0,k)\<e 

if e < e(p) and t > T[p) independently of the choice of A and 9. Therefore, 
for some Cg (0,oo) we have 

m „ m . 

/ 9e{vf^dr^<Y / pdX = Cp, 
j=Q JxeDj 

and this can be made arbitrarily small by choosing p sufficiently small. This 
completes the proof. □ 

Lemma 6.3. For any compact set C C A we have almost surely 
lim sup e-'^-^e'^^-^^^~'''''h'^/^[pt{l) - W^'^'^ {e)G(^^){e)] = 0, 



with 

e (2-kY J\n\<n 
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Proof. Generally we have W^*\e) = e*(i-^^e"'' '') pt{l)e^^-^ [cf. (8)] 
and 

IttJ 7| |<^ [1, if 1 = 1. 



Using this and Lemma 6.2, we obtain 

where the error term o(l) is uniform in / and in 9 in any compact subset of 
A. □ 

Remark 6.4. The next corollary deals with the special case when d = l 
and when Z takes only finitely many values in Nq. In particular, Corollary 
6.5 gives detailed information about the term Gi,t{0) defined in Lemma 6.3. 
Note that for the proof of Corollary 6.5 we use the Cauchy formula to obtain 

Finally, standard calculations such as those used in Lemma 6.3 lead then to 
the assertion of Corollary 6.5. 

Corollary 6.5. Let d = I and N C {0,1, ... ,L} for some L £ 'Nq. 
Then, almost surely, for any compact C C A, we have 



hm sup e-^^e*(i-^^^"''')Vt 



p,{l)-W^^\9)e-'^ 







with 



Di 3 = 1 



where Di := {(oi, . . . ,0^) G ^o-J2j=ij"-j = 0- 

The terms in Corollary 6.5 can be further calculated. Let pj := P{Z = j) 

for j G No and V^^^l) := the Poisson measure with parameter ^ > 0. 

Then 



L 
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J2jaj=lj=l 



With the local limit theorem in Petrov [(1975), Theorem 7 of Chapter VII, 
Section 2], 

{i-xr 



lim sup 

A— >oo ; 



exp 



2A 



0, 



we have 



(40) 



ViW {oo,-log{z)) 



E n 



X exp 



[ttj — tbz^pj) 
2tbz^Pj 



+ o(l) +0(1) 



For this reason, using Corollary 6.5, we obtain an expression which can be 
easily calculated for Ai,l> 1 in the binary search tree case [Z = 1 a.s. and 
Ai = (2t)'] [cf. Chauvin et al. (2005)], or also for the random recursive tree 
case [P{Z = 0) = P{Z = 1) = ^ and Ai = eH^]. 

Chauvin et al. (2005) calculated and estimated the expression in (40) to 
prove their convergence result. To prove our main theorem, Theorem 6.6, 
we chose a calculation inspired by a proof of Uchiyama (1982). 



Note that for g : ] 



I-)- 



we let Dg{x) = g'{x) denote its gradient and 



D g{x) its Hessian matrix at point x G M , if such exists. Further, for M G 



we denote by det M its determinant. Let 



(41) Ai- 
with 

(42) 



bEe- 



9-Z 



l = E I e-'^-^Xidx)-! 



where {Zi, . . . , Zh) = {z[^\ . . . , Z^'^') are distributed as the random weights 
attached to the edges that connect the root with its children l,...,b. To 
prove our main Theorem 6.6 we assume that X is nondegenerate in the 



for e^n, 
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sense that the support of the intensity measure of X is not contained in any 
d—1 dimensional hyperplane of M*^. Then D^A{—9) is a positive definite for 
all 9 E Oq, by the inverse mapping theorem the set := {DA{—9) : 9 £ Oq} 
is open, and the mapping 

9^c = DA{-9) 
is a homeomorphism of 0,q onto 0,q. 

Theorem 6.6. Let K he a compact subset of A* := {DA{-9) :9 e A} C 
and assume that X defined in (42) is nondegenerate. Then, almost surely 



lim sup 



ln{c) 



Ac{n) 



Woo{9{c)) 



where 9{c) G A is chosen so that for c =: (ci, . . . , G M'^ we have 



and 



DA{-9{c)) = bEZe-^ 
ln{c) :-- 



(cyz . 



c, 



clogn 




c\ log n 




Crflogn 






b-l 




b-l 



n 



(6£;e-»('=)-^-l)/(b-l) 



e-s{c)Mc)^(2Trlogn/{b-l)ydetD^A{-9{c)) 
r(l/(6-l)) 



r((6^e-^W-^)/(6-l))' 
For the limit Woo(-) we have 

(43) W^{9) = ^e-^-^^(^(^-))(^^^-"""^)/(''-^)Ty^,(,)(e), 

i=i 

where W^^(^i){9), . . . ,W^^(^i^-^{9) are independent, distributed as Woo(^) and 
independent of (U^^^) that are defined in (26). 

Remark 6.7. Prom Theorem 5.5 it follows immediately from uniform 
convergence on compact subsets that (VFoo(A),A G A) is a random analytic 
function. Furthermore, we have for ^ G A*, Woa{9) is the unique solution 
of the fixed point equation (43) with expectation one. For this result, note 
that for G A*, (Wn{9))n is a nonnegative martingale with an (absolute) first 
moment equal to one. From this we can conclude that EWoo{9) = 1 for all 
G A* (e.g., with Doob's limit law). Finally, using the result of Caliebe and 
Rosier (2004) we ascertain that the solution of the fixed point equation (43) 
with a finite nonzero expectation is unique up to a multiplicative constant. 
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We can also reformulate Theorem 6.6 in terms of / instead of in terms of 

c: 

Corollary 6.8. Let K he a compact subset of A* := {DA{-9):0 G A}. 
Assume that X defined in (42) is nondegenerate. Then, almost surely 

Ui{n) 



lim sup 

"^°°«;(6-l)/logn«eK 

where 6in^ K is chosen so that 



^(fe-l)«/logn(^) 



0, 



bEZe-'''"-^=^4^l^ 
logn 



and 



{bEe'"l-^"'^-l)/{b~l) 



X 



y/{2TTlogn/{b - l)YdetD^Ai-ei^n) 

r(i/(6-i)) 



Further, for Woo{-) we have 

where VFoo,(i)(^), • • • , VFoo,(b)(6') are independent, distributed as Woo (6*) and 
independent of (U^^^) where (U^^^) are defined in (26). 

Remark 6.9. Note that the following two procedures are equivalent: 

(1) take the supremum over c G C A* with K a compact subset and 
then choose 0{c), or 

(2) take the supremum over 9 £ C C A with C a compact subset and 
then choose c{9): bEZe~^'^ = c{9). 

Proof of Theorem 6.6. For the proof we will use Lemma 6.3 and 
obtain 

(44) 



e'^'^''''-''"''-'^e-'^Urj + o{l), 



\<TT 



where the error term is uniform for 9 £ C G A, a compact subset, and is 
uniform in /. We claim that 
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(a) 



27r 
1 



gT„(b£;e-9(-)-^e^''-2-l)g-i,y/„(c) 



<7r 



, + 0(1), n— ^00, 

V(27r)rfdetL'2^(-^(c)) 

with 0(1) uniform for c in any compact subset X C A*, and 
(b) 

y N {bEe-9W-2-l)/(6-l) 



(45) 



sup 

c&K 



< 00 



r((6Ee-^W-^)/(6-l)) 
for every K d K* compact. 

Since the functions in (45) are continuous in and then also in c, (b) follows 
immediately. 

The left-hand side of (a) is equal to 

^d/2^-T^A{-B{c))_}_ f ^rnA{-e{c)+irj)^-iiTl„ic) 

= r„^/2e--^(-^('=))(/i(r„) + /2(rO), 

where for t > 1 we set 
1 



hit) 



hit) 



tA{-e(c)+iv)^-ivMc)dT^ and 



1 



(2vr)'^ . '{7r>|r?|>7rt-l/3} 



dr]. 



For DAi-eic)) = bEZe-^'-''^^ = c we have with A = -6(0) + 



ir] 



(46) A(A) = Ai-eic)) +ic-r]- i(L>M(-0(c))r?) • r? + iB\i]\^ 

for — 7- where B = i?(A) is uniformly bounded for r/ — ?■ and c€ K. With 
(46), substituting r] = fi/y/r^, and using 

-W2){D^A(-eic))t.yf. (^27r)''idetD''Ai-eic)))-'/\ 



we obtain using Lemma 2.2 and (46), that 
r„^/^e--^(-^W)/i(r„) 
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gi(Ai/v^)-(c-r'n-[cr„+o(l)]) 



^ ^^(l/2)(D^A{~e{c))p,)■^,^iB\^,\^r-^'^ 

^ +0(1), 



V^(27r)^detL>2A(-^(c)) 

with 0(1) uniformly going to zero for cG K. 

Next we show |f'^/^e~*'^(~^'''^^^/2(t)| = o(l) uniformly for all c in a compact 
subset of A* . We have 

J{7r>|r?|>7rt-l/3} 

with 

q{r]) := A(-^(c)) - Re(A(-0(c) + if])) 

= i{D''A{-e{c))v)-v{l + o{l)), ri^O, 

and 0(1) uniformly for all c in a compact subset of A*. Additionally, if 
r/i, . . . ,rjm are those values with < |r/| < vr and r] ■ Z G 27rZ almost surely, 
we have 

q{7]) = Ai-e{c)) - Re{A{-e{c) + 
= bEe~^^^>^{l-cos{r]- Z)) 

= hEe~'^''>^\\ii - r?,f (1 + 0(1)), r? ^ r?,-, J = 1, . . . ,m. 

Now note that Re(74(A)) < A{9{c)) for vr > |r/| > (5, / rjj,j = l,...,m, for 
all > 0. We can therefore choose e > so small and independently oi c£ K 
so that for all tt > |r/| > TTt~^^^ it follows that 

> min|^^^:^^^^vrV/^^^6i?e~^(^^^^ 

for some suitable constant C > and t sufficiently large (resp. n if t = Tn). 
It follows |\/te-*^(-^W)/2(i)| =0(1) and also | VTne-"'"^(-^(^))/2(rn)| = o(l) 
with the error term as claimed. We obtain from Theorem 5.8 and (44), 
choosing t = Tn, I = ln{c), 

= (T^oo(0(c))+o(l)) 

r(l/(b - 1)) {Y/jb - l))(fei^e~^M-^-l)/(fe-l) 

r((6Se-%)-^)/(6-l)) ^(27r)'^detZ)2A(-^(c)) 
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With (18) we have 



gT„(fe£;e-eW-2-l) 



Y N (feSe-''W-^-l)/(fe-l) 
b~) 

~a.s.n(^^-'^'^'-"-i)/(^-i)(l + o(l)), 
and we obtam almost surely with t„ ~a.s. ^grx^i''^ — ^ oo, 



e 



Hc)Mc) ^(27rr„,)^ det D^A{-B{c)) r((6Se-^W-^)/(6 - 1)) 



^(6Ee-SW-^-l)/(6-l) 



e-e(c)./„{c) ^(27rlog - l))"^ det D^A{-e{c)) 



r(l/(6-l)) 



r((6£;e"^W-^)/(6-l)) 
=:A,(n). □ 

Remark 6.10. Consider the case d=l and let Z be bounded. Define 

f{z) = 1 - bEz^ + \og{z)hEZz^ 

for z > 0. Then it follows immediately that A = {6' G ]R:/(e~^) < 0} [cf. 
Remark 5.7]. Since f'{z) = ^{log{z)bE z^ ) = <^ z = 1 we have a single 
local minimum {f"{z) > 0) at the point z = 1 with /(I) = 1 — 6. 

Further, for Z >0 a.s. we have 

lim /(z) = 1 - 6po, lim/(z) = oo. 

2\0 Z-^OO 

Consequently, we have one root of / if po > and otherwise two roots of /. 
In the first case, let zq = and let zi be the root of /. In the second case, let 
zq < z\ be the two roots of /. In both cases we have A = (— log(zi), — log(zo)) 
[where we set — log(O) :=oo]. 

7. Examples. Note that most of the following examples are taken from 
Broutin and Devroye (2006). 

In order to simplify notation and to work out various connections to 
known results, in the case d= \ and for z G R"*", we use M^ai^z) := VFoo(— log(z)) 
instead of Woo(A) [cf. (12)]. Set 



V 
V 

V* 



{e-^AG A}, 
F n M and 

{bE{Zz^):zeV} = A*. 



In complete analogy to Corollary 6.8 we have the following: 
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Corollary 7.1. Let d=l and let K be a compact subset of 
V* ■.= {bE{Zz^):zeV}. 

Then almost surely 

Ui{n) 



lim sup 

^^°°l:{b~l)l/logn£K 



Mooizi,n) 



0, 



where zi^n is chosen so that 

logn 

r(i/(6-i)) 

^(.-iWio,„(n) :- _ ) TibEzfJib - 1)) 



and 

where A/qq . . . , (z) are independent, distributed as Moo{z) and 

independent of (U^^^) where (U^^^) is defined in (26). 

Example 7.2 (Random binary search tree). A random binary search 
tree can be built incrementally. Let Ui, . . . ,Un be independent random vari- 
ables uniformly distributed over the unit interval. We start the tree by stor- 
ing Ui in the root node. If U2 is greater than Ui, we add a right child to the 
root and store U2 in that node. If U2 is less than Ui, we add a left child to 
the root and store U2 in that node. Then we repeat that procedure incre- 
mentally for U3, . . . ,Un- The nodes where we stored some Uj for some j are 
called internal nodes. We refer to Devroye (1991), Devroye (1998) and the 
references given there for the construction of binary search trees. A summary 
of known results about binary search trees is given in Mahmoud (1992b) and 
Knuth (1998). 

Let 7^ be a random binary search tree with n (internal) nodes. We will 
only consider complete binary search trees. That means that we add n + 
1 external nodes to each binary search tree with n internal nodes in the 
following manner. If u is an internal node and has no offspring, we add two 
external nodes as its potential children to it. If it has already one child, then 
we add one external node to u as a second potential child. If u has already 
two children, we add nothing. Note that every external node corresponds to 
one of the free places available for the sorting of a new internal node and 
that each free place is likely to be chosen next with equal probability. 
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It is well known that for the random binary search tree Tn we have 

\m.in{Du.uedTn]\ | max{i:>„ : n G 07^}| 
hm = a_, hm = 0+) 

rn>oo logn rn>oo logn 

where a_ , a+ are the only nonnegative solutions of the equation x log | — x + 
2 = 1 [see, e.g., Devroye (1986, 1987, 1998) and references given there]. Chau- 
vin et al. (2005) proved the following result, which is covered by Theorem 
6.6: 

Theorem 7.3 [Chauvin et al. (2005)]. Almost surely, for any compact 
subset K of {a-,a+), 



( Uijn) / / 



lim sup - Moo TT, = 0- 

''^'^i:i/iogneK\EUi{n) \2\ognJJ 

The profile of the binary search tree was also studied with other methods. 
Let a denote the limit ratio of the level and the logarithm of the tree size. 
Then, Fuchs, Hwang and Neininger (2006) proved convergence in distribu- 
tion for a G y* = (a_,a+) and for a G [1,2] convergence of all moments to 
prove their results. They used the contraction method and the method of 
moments. 

Drmota, Svante and Neininger (2008) treated a class of generalized m-ary 
search trees including binary search trees. For those trees they proved that 
in a certain range the normalized profile converges in distribution (Theorem 
1.1). They used arguments based on the contraction method in order to 
prove convergence in distribution of several random analytic functions in a 
complex domain. 

Example 7.4 (Random recursive tree). A random recursive tree is built 
inductively. The tree Ti consist of a single node 0, the root. Let %% already 
exist and consist of the nodes {vi, . . . , Vn}- To grow the tree choose a node Vj 
out of the set {vi, . . . , Vn} uniformly and at random, and attach the new node 
Vn+i as a child to node Vj [cf. Smythe and Mahmoud (1994) and references 
given there]. 

Fuchs, Hwang and Neininger (2006) showed that the profile of the random 
recursive tree normalized by its mean converges in distribution if the limit 
ratio a of the level and the logarithm of the tree size lies in [0, e). They also 
showed convergence of all moments to hold for a G [0, 1]. Furthermore, they 
proved that inside the interval (l,e) only convergence of a finite number of 
moments is possible. Drmota and Hwang (2005a) showed that the variance 
of the profile Ui{n) of the random recursive tree asymptotically undergoes 
four phase transitions and exhibits a bimodal behavior in contrast to the 
unimodality of the expected value of the profiles (cf. comments made on this 
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topic in the Introduction). For / around the most numerous level (where the 
width is attained), the value of the martingale shall be a.s. constant; more 
precisely one has Moo{l/ logn) = 1 almost surely [cf. Drmota and Hwang 
(2005a, 2005b)]. In the sequel, Drmota and Hwang (2005b) sketched that 
Ui{n) ~a.s. Mao{a)EUi{n) almost surely, where a = lim„ G [0, 1), using a 
martingale argument of Chauvin, Drmota and Jabbour-Hattab (2001) and 
Cauchy's integral formula. 

We will show below as an application of Theorem 6.6 that the profile of 
the random recursive tree normalized by its mean converges almost surely if 
the limit ratio a of the level and the logarithm of the tree size lies in (0, e). 
Additionally the profile converges uniformly for a in any compact subset of 
(0,e). 

First note that it is possible to interpret a random recursive tree with n 
internal nodes as a weighted binary tree TV^ ^ with n — 1 internal nodes by 

weighting the edges with independent copies oi Z = Bernoulli(^) and finally 
by interpreting the n external nodes of the latter as the internal nodes of the 
former. We have to choose Z2 = 1 — Zi. This follows immediately since every 
external node in the weighted binary tree is equally likely to be the next 
one to die and to get two external children where in the recursive tree each 
internal node is equally likely to be the next one to produce an offspring. For 
more details on the construction we refer to Broutin and Devroye (2006), 
Section 4.2. 

With this construction it is clear that not only is the height treated in 
Broutin and Devroye (2006), but also the distribution of the profile is kept 
by this construction. By embedding the random recursive tree (7^)n>i in 
the weighted tree process and by identifying {Tr„_^)n>i with {Tn)n>i-, we 
deduce the following: 



Theorem 7.5. Let K C (0,e) be a compact subset. Then almost surely 

= 0, 



lim sup 

"^°°/://logn6i<: 



EUi{n) \logn 



and for z £ (0, e), 

Moo(z) = zU'M^^^,){z) + (1 - C/)^M^,(2)(z), 



where Moo (j)(z) = M^{z),U is uniform[0, 1] random variable and Mao{z), 
-^00,(1) (-2^); -^00,(2) (-2^) '^''^d U are independent. 

Proof. Obviously, using Corollary 7.1, we have: 

(1) v = {z>0:l-{l + z)<- log{z)z} = (0, e); 

(2) V* = {2E{Zz^) = z:zeV} = {0,e); 



FUNCTIONAL LIMIT THEOREM FOR THE PROFILE OF S-ARY TREES 35 
and we have zi^n = Then with 

A/\ogn{n-) 



r(l + // log n) (V log nf^Thd 
and 



r(l + //logn)V2^ 
we easily obtain 

%f4^=i+o(i). 

Finally, we note that Hwang (1995) showed that [see also 
Fuchs, Hwang and Neininger (2006), equation (3), page 2] 

which yields the theorem by using the Stirling formula. 

From Theorem 5.8, part (2)(b), using Zi = 1 - Za, ^/(i) + = 1 and by 
setting V := U^^\ we obtain 

Moo(z) = zC/^M^,(i)(z) + (1 - f/)^M^,(2)(z) 

and that the claimed independence relations also hold. For the distribu- 
tion of U note that if Ei,E2 are independent, exponentially distributed 
random variables, then ^^^^^ is uniform[0, 1] distributed. Now Y , defined 
after Lemma 5.2, is Gamma distributed with parameters (1, 1) which is the 
same as being exponentially distributed with expectation one. It follows that 
i = 1,2, is uniformly [0,1] distributed. □ 

Note that V* = (0, e) is the natural range for convergence, since, Devroye 
(1987) and Pittel (1994) showed for the height Hn of fn that 

lim 7- e. 

logn 



n—^oo 



So e should be the upper bound for any range of convergence of the profile. 

Example 7.6 (Random lopsided trees). Prefix-free codes are particu- 
larly interesting because they can be decoded directly by following a path in 
a tree and output a character corresponding to the codeword when reaching 
a leaf. Each node u represents a prefix p and its children represent the words 
that can be built by appending a symbol to p. When reaching a leaf, one 
obtains a character corresponding directly to the codeword. 
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Some codes have encoding length depending on the symbols. These codes 
are called Varn codes [cf. Varn (1971)] and naturally lead to lopsided trees. 
Lopsided trees are trees with edges having nonequal length. We refer to 
Broutin and Devroye (2006) for further details, especially on the height 
of such trees and for further references. There are no results about the 
asymptotic behavior of the profile of random lopsided trees yet. 

Let ci < C2 < • • • < Cfe be fixed positive integers. A tree is said to be lop- 
sided if it is 6-ary rooted and for each node, the edge to its jth child has 
length Cj, l< j < b. 

A random lopsided tree can be constructed incrementally in the following 
way: The tree 71 consists of a single internal node 0, the root. Additionally, 
we attach b external children to the root node. If Tn already exists, take an 
external node uniformly and at random and replace it by an internal node. 
The weights of the edges from that internal node to its b external children 
are ci,. . . ,Cb- It is clear that Z in the weighted 5-ary tree framework has to 
be chosen as Z = cw where is a uniform distributed random variable on 
the set {1, . . . , b}. 

Then, with Corollary 7.1 and Remark 6.10, by embedding the lopsided 
trees in the 6-ary tree model, and by identifying (T^-^) = (Tn) we have the 
following result for the profile {Ui{n)): 

Theorem 7.7. If K is a compact subset of V* := {Z]j=i Cj-z'^^ :z eV} 
with V = (zo,zi) where zq,zi are defined in Remark 6.10, then, almost 
surely, 

Ui{n) 



lim sup 

"^°°Z:(fe-l)Vlogn6-ft: 



Moo(Zi,„) 



0, 



^{6-l)//logn("') 

where zi^n is the solution z ofY^^j=iCjZ^i = -^^^g^, and 

^..^:„-i)/(j>-i) r(i/(6-i)) 



\b-l)l /logni'n) 



„^2vr log(n)l/(6 - 1) Y.U ^^^n ^^^^U Kn)l^^ - 
We have 



b 



Moo(.)=5^z^^(^0-))V(^-^)(^°-^^^-^)M^,(,)(.) 



with Moo{z) = M^(j\{z),j = l,...,b, U^^^ = ^ , where Yj are i.i.d. 
random Gamma(j^,^^) distributed random variables, and Moo(z), 
Moo^(i)(z), . . . , Moo^(h)(z), {U^^\ . . . ,11^''^) are independent. 
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Example 7.8. Consider the following tree model. Start with a single 
internal node. At each step the tree is expanded by choosing uniformly 
and at random an internal node out of the tree and then by replacing it 
with a given deterministic tree T* . This model can be described by the 
model of lopsided trees. Assume \T* \ = k. Imagine a lopsided tree in which 
each replaced node gives birth to k children with edge weights equal to 
the distances of the nodes in the tree T* to the root of T* . The internal 
profile can now be calculated using the external profile of the corresponding 
lopsided tree. 

Example 7.9 (Plane oriented and linear recursive trees). Plane ori- 
ented trees (PORTs) are rooted trees in which the children of every node 
are oriented. The depths of nodes in random PORTs have been studied by 
Mahmoud (1992a) and their height by Pittel (1994). PORTs can be built 
recursively; start with one single node, the root. If Tn already exists, add 
node Vn+i uniformly and at random in one of the slots available. The slots 
are the positions in the tree that lead to different new trees. One can think of 
the slots as external nodes that are placed before, between and after internal 
nodes. So a node with k>l children has k + 1 external nodes attached to it, 
always one external node between two (internal) children and one in front 
of the first (internal) child as well as one after the last (internal) child. If an 
internal node has no children, then we attach one external node to it as a 
potential child. 

A more general model of recursive trees is based on Pittel (1994). In these 
recursive trees each node u has a weight Wu- When growing this kind of tree, 
a new node is added as a child of node u with probability proportional to 
Wu- Now Wu '■= ^ + /3deg(n), where deg(n) denotes the number of children 
of u and /? > 0, is called the parameter. When f3 is an integer, we can use the 
general tree model of Broutin and Devroye (2006) to describe those trees. 
Let /S G N and 7^ be such a random recursive tree with parameter /3 and 
with n internal nodes where Tf = {0} consists of a single node, the root. The 
tree is expanded by adding a child to node u with probability proportional 
to 1 + /3deg(n). Alternatively we can choose an external node uniformly and 
at random where we attached to each internal node u deg(ii)/3 + 1 external 
nodes. So when we pick an external node at level d and replace it by an 
internal node, we attach /3 + 2 new external nodes to the tree, /3 + 1 on level 
d and one at level d+1. 

Now consider (/3 + 2)-ary weighted trees {Tt)t>o where the tree process is 
stopped when having n internal nodes. When choosing an external node uni- 
formly and at random from the set of all external nodes and when replacing it 
by an internal node, we add /3 + 2 external nodes to that new internal node 
with weights Z = (Zi, . . . , ^/3+2) where = 0, 1 < j < /3 + 1, Z(/3+2) = 1 
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(the brackets in the index means that the weights are ordered by increasing 
values). The external profile of that tree has a similar distribution as the 
external profile of the random recursive tree with parameter /3. Let Ui{n)^ 
be the number of external nodes in the tree 7^ on level I and Ui{n) be the 
number of external nodes in the tree Tn = 7V„ , the corresponding weighted 

(/3 + 2)-ary tree. Then Ui+i{n + 1)^ = Ui{n). Note that for /3 = we obtain 
the random recursive tree of Example 7.4 and for /? = 1 the so called PORTs. 

We can choose Z = Bernouni(-gi2) with \{j < /3 + 2: Zj = 0}\ = /3 + 1. 

Before formulating the convergence theorem for these recursive trees with 
parameter /? S N we remark that the profile of plane-oriented recursive trees 

= 1) was analysed by Hwang (2005). For a G [0, |] he obtained conver- 
gence in distribution and of all moments of the normalized profile, where 
the limit is uniquely characterized by its moment sequence. Hwang (2005) 
presented no solution for the problem of convergence for a ^ [0, ^] since for 
a ^ [0, ^] only convergence of a finite number of moments is possible. As 
a consequence, the characterization of the limit via moments is not possi- 
ble. In addition, no fixed point equation had been known until now. Hwang 
(2005) anticipated convergence in distribution of the normalized profile for 
a E (^, a*) where a* is the solution of \ + z + 2:log(2z) = 0. 

We show here even more; namely, we prove uniform almost sure conver- 
gence for a in any compact subset of (0,a*). Note also that our construction 
also shows how the tree could be split into subtrees in order to use the con- 
traction method. We identify (T^)n>o = (^T„_i)n>o where the trees {T^)t>o 
are defined as the trees (Tt)t>o with the root node (resp. the imaginary edge 
eo to the root node) having itself the weight 1. Then we obtain 

[u£dT;:Du= Z, = l + l\ 

^ ee7r(w) ^ 

Note that eo S Tr{u) for all u. Finally, it follows that 

Ui+^{n+lf = p*{l + l)=PrSl). 



pl{l + l):-- 



■Ptil). 



Theorem 7.10. Let K £ (0,zo) be a compact set where zq is the only 
solution of zlog{z) — z — (3 = 0. Then almost surely 



lim sup 

"-^°°;:(/3+l)//lognei<' 



with 



A 



(/3+l)Z/logn("-j 



^(b-l)l/logn{n) 
^(/3+^;.n)/(/3+l) r(l/(/3 + l)) 







^2^1 T{l + zi,n/{/3 + l)y 
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where zz,„ := (/5 + Further 

j=l \ j=l J 

where (^^(z), . . . , (•^_|_2)(z) are independent, distributed as Moo{z) 
and independent of {U^^\ . . . ,17^/^^^'^) with = — where 

y^- = Gamma(-^^, -g^) i.i.d. 

Remark 7.11. For f3 = 1, zq is the only solution of ^ + | — | log(z) = 0. 
So / has to be chosen so that G (0,zo/2). Obviously zo/2 is the only 
solution of ^ + z — zlog{2z) = 0, zo/2 = a*. 

Proof. We have b = p + 2, Z = Bernouni(-gi2), bEz^ = l3 + l + z. It 
follows that 

V = {z>0:l- bEz^ < - log{z)bE{Zz^)} 
= {z >0: zlog{z) - z - l3 < 0}. 
Since bE{Zz^) = z we have V = V* . Define f{z) := 2;log(z) — z — (3. Then 

f'{z) = log{z), nz)>0, 
and (1, —1 — /3) is a local minimum of /. Since 

lim f{z) = -/3, lim f{z) = oo, 

there is only one solution of f{z) = that we will call zq. In the interval [0, zq) 
f is negative other than that nonnegative. The rest follows from Corollary 
7.1. □ 

Note that independently of this work Sulzbach (2008) proved a functional 
limit theorem for the profile of plane oriented recursive trees using the mar- 
tingale method. 

Example 7.12 (Changes of direction in a binary search tree). Let Tn 
be a random binary search tree with n internal nodes, and let n G 7^. Define 
Dn{u) := Dn{7r{u)) as the number of changes of direction in 7r{u) where tt{u) 
is the path from the root to node u. Now let and 1 encode a move down 
to the left and to the right, respectively. For example the path encoded 
by 1001010110 will have D = 7, that is, a count of each occurrence of the 
patterns 01 and 10. 
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We are interested in Di{n) := \{u G dTn ■ Dn{u) = l}\. Broutin and De- 
vroye (2006) introduced the following labelling of the edges: for each level 
l>2 of edges form the word (0110)'"^, and map the binary characters to 
the edges from left to right. Call this weighted binary tree Tn- Then, by 
embedding, we find that Di{n) = |{n G dTn ■ = Consequently choose 

Z = binomial(^), Z2 = 1 — Zi, and obtain the following as in the random 
recursive tree case: 



Theorem 7.13. Let K C (0,e) he a compact subset. Then almost surely 

= 0, 



lim sup 



where 



EDi{n) ylogn^ 

Moo(c) = cu'MM + (1 - urM:^{c), 



where M^{c) = Ml^{c), U is a uniform[0, 1] random variable and Moo{c), 
M^(c) and U are independent. 

Example 7.14 (Random /-colouring of the edges in a tree). Take a ran- 
dom binary search tree and randomly color the edges with one of / different 
colors. We can think of different problems in that framework. For instance, 
we could be interested in the question how many nodes u have exactly I red 
edges in tt(u) if color red appears with probability p. For this problem we 

have to choose Z = Bernoulli (p), since we count only red edges which we 
mark with Z = 1 and all other colored edges with Z = 0. Let Dn{l) be the 
number of nodes in the tree Tn with exactly / red edges in 7r(n). 

Theorem 7.15. Let K C {2zp:z G V} with V = {z > 0:2pzlog{z) - 
2zp + 2p — 1 < 0}. Then almost surely 

Dn{l) 



0, 



lim sup 

"^^"^ 1:1 /logneK ^//lognl'^J 

where we set z = zi^n ■= 2pLgn "^'^ 

.n \ 

z[^V2^l r(2(l-p) + 2pz,,„))' 
Moo(z) = z^^C/i-2^'+2p^M^,(i)(z) + z^^{l - U)'-^P+^P^M^^(^2){z), 
where M^^(^i^{z), M^^(^2){z) o,'"'^ independent, distributed as Moo{z), inde- 
pendent of Zi,Z2,U where U = uniform [0, 1] and Zi,Z2 are independent, 
identically distributed with Bernoulli (p) distribution. 
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Proof. Since 6 = 2, hEZz'^ = 2zp, hEz^ = 2(1 - p + zp), we have V = 
{z>0:2pzlog(z)-2zp + 2p-l<0} and V* = {2zp: z eV] . □ 

For the random recursive tree the number of nodes with paths having 

exactly / red edges can be analyzed taking Z = Bernoulli(p) x BernouIli(l/2), 

thus having Z = Bernoulli(p/2) [cf. Example 7.4]. The random recursive tree 
can be interpreted as a weighted binary tree. Now randomly color the edges 
of this tree. The probability of having a red edge is now p and the probability 
of having an edge with weight 1 is 1/2. This model could alternatively be 

analyzed in a 2-dimensional weig hted model where Z = (Z^^), Z^^)), Z^^) = 

Bernoulli (1/2), Z^^^ = Bernoulli (p) where Z^^^Z*-^-* are independent. 

Example 7.16 (The left minus right exceedance). Let 7^ be a binary 
search tree with n internal nodes and let u € 7^. Define Du := Ylie&-K(u) i^i^) ~ 
R{e)) where L{e) is the indicator function of e being a left edge and R{e) is 
analogously the indicator of e being a right edge. We are interested in 



namely, the number of external nodes in our binary search tree which have 
exactly I more left edges than right edges in the path from the root leading 
to that external node. Naturally in the framework of weighted 6-ary trees 
we choose 6 = 2 and mark all right edges with —1 and weight all left edges 
with 1. Since right and left edges are equally likely to be chosen, we use Z 
with P{Z = 1) = 1 = P(Z = -1). 

Theorem 7.17. Let K be a compact subset of [zq — -^,zi — ^) where 
< zq < zi are the two positive solutions of 



Ui{n):=\{uedTn:Du = l}\ 



1 — z \- z log(z) log(z) = 0. 



Then almost surely, 




where we define 



1 



n 



A/ log 



n 



+ ^/Zl^n) z\^-^2'K{zi^n + l/-ZZ,n) log(ri) 



with 
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Further we have 

where (^^(z), Moo,(2)(-2) are independent, distributed as Moo{z) and in- 
dependent of U = uniform[0, 1] . 

Proof. First note that 

V = {z>0:l- bEz^ + \og{z)bE{Zz^) < 0} 

z>0:l- (^z + ^^ + log(z) (-2 - < o| . 
Easily we obtain for z > with g{z) := 1 — {z + ^) + log(z)(z — ^) 
5'(z)=log(z)(^l + ^^ =0 ^ z = l, 
/(1)>0, 

that g has a single local minimum at z = 1 with g{l) = —1 and since g is 
continuous on (0,oo) with 

limg{z)=oo, lim g{z) =oo, 

there are exactly two roots of the function g on (0, oo). Call them < zq < zi. 
Now V* :={z-l:ze (2:0,2:1)} = (20-^,21- ^). If c:= j^^, then choose 

z = 2(c) >0:z-^ = c^z'^-cz-l = 0^z = ^ + y^(§)2 + 1. Prom this the 
proof follows. □ 

Example 7.18 (Stochastic models for the web graph). We give a new 
example not contained in Broutin and Devroye (2006). The web may be 
viewed as a directed graph in which each vertex is a static HTML web page, 
and each edge is a hyperlink from one web page to another. Kumar et al. 
(2000) proposed and analyzed a class of random graph models inspired by 
empirical observations on the web graph. These observations suggested that 
the web is not well modeled by traditional graph models. 

The linear growth copy model of Kumar et al. (2000) is parameterized 
by a copy factor a £ (0, 1) and a constant outdegree d>l. Only the choice 
d = l results in a random forest that might be turned into a tree and studied 
using our framework. 

We start with one single vertex. Assume that the random forest Tn with 
n internal nodes has already been created. At each time step, one vertex 
u is added by the following procedure: from the tree Tn choose a vertex 
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uniformly and at random. Call this vertex v. With probability a we attach 
node M as a child to node v. With probability 1 — a the node u becomes a 
brother of node v; that means that we attach node child to the father 

of node v. If node u is a root with no ancestors, we let u be an isolated 
node, namely the root of a new tree consisting of that single node. We could 
now ask how many nodes are roots, nodes on level 1,2,... and so on in that 
random forest. 

We can interpret this random forest as a binary tree with weighted edges. 
When raising the forest we may instead raise the binary tree as follows. In 
the random forest a new node u is attached by choosing uniformly and at 
random an internal node v out of the existing forest Tn- In the binary tree 
Tn-i = Tt-„_i we will instead choose an external node, call it v, uniformly 
and at random from one of the n external nodes. With the probability a, the 
new node in the forest will be a child of node v and located one level below 
V. We transmit this by making the external node v in the binary tree an 
internal one and attach two new external nodes to v, one with edge weight 
0, representing v, and the other with edge- weight 1, representing u. With 
the probability 1 — a, node u becomes a brother of v, that means it stays 
on the same level as node v. Then we will replace the external node v in 
the binary tree by an internal node and attach two new external nodes to 
it, one with edge weight 0, representing v, and the other with edge- weight 
0, representing u. Then an arbitrary edge has weighted one with probability 

a/2 and otherwise it has weight zero. Choose Z = Bernoulli(^) and the 
weights Zi, Z2 attached to the root of the binary tree as follows. Let Zi = Z 
and Z2 = l{Zi=o}y with Y = Bernoulli (g^^) and being independent of Zi. 

Then Z2 = Z and the resulting tree grows as the tree described above. 
By embedding we obtain the following: 

Theorem 7.19. Let K be a compact subset of V* := {azo,azi) with 
zq,zi the two roots of the function f{z) = —1 + a + az{l + log(z)). Then 
almost surely 




with 



n 



1— a+i/logn 



1 




and 



Moo{z)=Z' 



1 — Q+C 



^^oo,(l)W + ^' 



(l-U) 



1—a+c 
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where (•2)(2) are independent, distributed as Moo{z) and in- 

dependent of U where U = uniform[0, 1] . 

Example 7.20 (Combination of weights). Higher dimensional weights 
can be used to describe all the trees studied earlier with additional weights 
attached to the nodes or, alternatively, edges. For example we can study 

a 2-ary tree with Z = Z^^) = 1 and Z^^) £ binomial(i) which 

refers to a random binary search tree with edges marked with zero or one. 
We can think of a situation where we use the second weight for identifying 
if the ancestor passes some attribute on to its child (= 1) or not (= 0). Let 



Applying Theorem 6.6 (resp. Corollary 6.8) we can then describe the asymp- 
totics of that numbers. 
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